Mining Indirect Associations in Web Data
نویسندگان
چکیده
ABSTRACT Analysis of asso iation is an important Web mining te hnique be ause it an provide useful insight into the navigational behavior of Web users. E-tailers an use this information to develop strategi marketing plans and to re-stru ture their Web site in order to enhan e the browsing experien e of their ustomers . Previous work on mining Web asso iations has fo used primarily on nding frequent a ess patterns in the data. These patterns an be generated by Web users who share similar information goals or by those with varying interests. Sin e Web asso iation patterns onsider only o-o urren es in data, it is diÆ ult to identify patterns generated by one group of Web users but not by the others. Another drawba k of the existing approa h is that it does not adequately address the impa t of Web site stru ture on the support of a Web page. As a result, the majority of Web asso iation patterns dis overed using onventional te hniques ontain the home page or other referen e pages that have multiple outgoing links. In this study, we apply a new mining te hnique alled indire t asso iation to Web usage data. This novel te hnique is apable of ombining the various asso iation patterns into a more ompa t stru ture. It an also apture both positive and negative orrelations that exist in the data. We demonstrate the appli ability of this te hnique on Web data from both ommer ial and resear h institutions. Our analysis shows very promising results, espe ially in terms of identifying Web users with distin t interests.
منابع مشابه
Indirect Positive and Negative Association Rules in Web Usage Mining
One of the purposes of Web usage mining is to find out interesting user association rules from web server logs. It has become vital for personalization, effective web site management, business and support services, creating adaptive web sites, and so on. In the web domain, items correspond to pages and transactions to user sessions. Indirect associations, type of infrequent pattern provide usef...
متن کاملAutomatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...
متن کاملEfficient Mining of Indirect Associations Using HI-Mine
Discovering association rules is one of the important tasks in data mining. While most of the existing algorithms are developed for efficient mining of frequent patterns, it has been noted recently that some of the infrequent patterns, such as indirect associations, provide useful insight into the data. In this paper, we propose an efficient algorithm, called HI-mine, based on a new data struct...
متن کاملMining Indirect Association between Itemsets
Discovering association rules is one of the important tasks in data mining. While most of the existing algorithms are developed for efficient mining of frequent patterns, it has been noted recently that some of the infrequent patterns, such as negative associations and indirect associations, provide useful insight into the data. Existing indirect association mining algorithms mine indirect asso...
متن کاملExploring Biomolecular Literature with EVEX: Connecting Genes through Events, Homology, and Indirect Associations
Technological advancements in the field of genetics have led not only to an abundance of experimental data, but also caused an exponential increase of the number of published biomolecular studies. Text mining is widely accepted as a promising technique to help researchers in the life sciences deal with the amount of available literature. This paper presents a freely available web application bu...
متن کاملMining Indirect Association Rules for Web Recommendation
Classical association rules, here called “direct”, reflect relationships existing between items that relatively often co-occur in common transactions. In the web domain, items correspond to pages and transactions to user sessions. The main idea of the new approach presented is to discover indirect associations existing between pages that rarely occur together but there are other, “third” pages,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001